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Abstract: The cytochrome P450 (CYP)3A4 enzyme affects the metabolism of most drug-like 
substances, and its inhibition may influence drug safety. Modulation of C YP3 A4 by flavonoids, 
such as anthocyanins, has been shown to inhibit the mutagenic activity of mammalian cells. 
Considering the previous investigations addressing CYP3 A4 inhibition by these substances, we 
studied the three-dimensional quantitative structure-activity relationship (3D-QS AR) in a series 
of anthocyanin derivatives as CYP3A4 inhibitors. For the training dataset (n=12), compara- 
tive molecular field analysis (CoMFA) and comparative molecular similarity index analysis 
(CoMSIA) yielded crossvalidated and non-crossvalidated models with a q 2 of 0.795 (0.687) and 
r 2 of 0.962 (0.948), respectively. The models were also validated by an external test set of four 
compounds with r 2 of 0.821 (CoMFA) and r 2 of 0.812 (CoMSIA). The binding affinity modes 
associated with experimentally derived IC 50 (half maximal inhibitory concentration) values 
were confirmed by molecular docking into the CYP3A4 active site with r 2 of 0.66. The results 
obtained from this study are useful for a better understanding of the effects of anthocyanin 
derivatives on inhibition of carcinogen activation and cellular DNA damage. 
Keywords: three-dimensional quantitative structure-activity relationship, cytochrome P450 
3A4, comparative molecular field analysis, comparative molecular similarity index analysis, 
anthocyanin derivatives, molecular docking, carcinogen activation 
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Introduction 

Cytochrome P450 (CYP) proteins play important roles in metabolism in many species. 
These enzymes are involved in the detoxification of not only endogenous molecules, 
but also various exogenous substances, such as biogenic amines, 1 cytostatic agents, 2 
and steroids. 3 A substantial fraction of the CYP family is CYP3 A, which comprises up 
to 30% of the total liver CYP enzyme pool in humans. 4 Moreover, its 3A4 isoenzyme 
is believed to be responsible for approximately 60% of metabolic drug changes. 5 

Notwithstanding the membrane-associated state of mammalian CYPs and difficulties 
in their crystallization, significant progress has been made recently in this direction by 
obtaining precise and detailed information about CYP structures. 6 " 8 In addition, a number 
of three-dimensional quantitative structure-activity relationship (3D-QSAR) studies, 
including comparative molecular field analysis (CoMFA) and comparative molecular 
similarity index analysis (CoMSIA), have been performed on the different CYP inhibitors 
to predict, correlate, and interpret experimentally determined inhibition values and assess 
substrate selectivity. 9-11 These methods are based on the assumption that the changes in 
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inhibition activity of chemical substances are related to changes 
in structural properties represented by molecular fields. 12 

Several flavonoids have been shown to modulate the CYP 
system and alter the pharmacokinetics and pharmacody- 
namics of many commonly prescribed drug substances. 1314 
These water-soluble glycosidic derivatives of flavylium salts 
are most prevalent in various fruit and vegetables, such as 
cabbage, grapes, and berries. 15 It was previously reported 
by Dreiseitel et al that CYP3A4 inhibition by a series of 
anthocyanin derivatives, including their aglycones and 
proanthocyanidins, is achieved in a concentration-dependent 
manner. 14 However, it was suggested by the same authors that 
their inhibitory potencies were adversely correlated with the 
number of sugar moieties responsible for a decrease in the 
effects of anthocyanidins on CYP3A4. 14 

The antimutagenic activity of anthocyanidins was earlier 
established by Gasiorowski et al, whose results revealed that 
these compounds, isolated from the fruit of Aroma melano- 
carpa, sufficiently inhibit the mutagenic activity of benzo(a) 
pyrene and 2-aminofluorene in the Ames test. 16 Hence, CYP 
inhibition by anthocyanin compounds might be considered 
as antimutagenic, given that these enzymes contribute to 
carcinogenesis in several organs 17 " 20 and increase DNA tox- 
icity and damage. 21 

Therefore, in order to evaluate a series of anthocyanin 
derivatives as CYP3A4 inhibitors, two computational 
approaches were used in the present study, ie, 3D-QSAR 
and molecular docking techniques, to analyze and match 
experimentally derived CYP3A4 half maximal inhibitory 
concentration (IC 50 ) values to their predicted ones together 
with the binding affinity modes. To our knowledge, this work 
presents the first 3D-QSAR and docking methods that might 
be of interest to determine the most appropriate CoMFA and 
CoMSIA prediction models for selective CYP inhibition and 
to block the mutagenic and carcinogenic activity mediated 
by CYP activation mechanisms. 

Computational methods 

The chemical formulae of 16 anthocyanin derivatives were 
obtained from research papers published elsewhere. 14,18 The 
two-dimensional structures of the compounds were drawn using 
the MarvinSketch version 5.2 program (ChemAxon, Budapest, 
Hungary). After the two-dimensional sketches were converted 
into three-dimensional images, explicit hydrogen molecules 
were added and recorded in PDB (Protein Data Bank) format; 
3D-QSAR modeling was done using Sybyl-X 1 . 1 software (Tri- 
pos International, St Louis, MO, USA). Rigid body alignment 
of anthocyanin derivatives was performed using a maximum 



common substructure by the PyMol software (Schrodinger 
LLC, Rockville, MD, USA) 22 to build the CoMFA and CoMSIA 
models. CoMSIA similarity indices (A pK ) for a molecule j with 
atoms i at a grid point q were calculated as follows: 

i=l 

where AT represents the following physicochemical properties: 
steric and electrostatic descriptors; co piobt:K indicates probe 
atom; a is an attenuation factor; co. K is the actual value of the 
physicochemical property K of atom z; and r^iq is the mutual 
distance between the probe atom at grid point q, and atom i 
of the test molecule. 

The compounds were divided into a training dataset (com- 
pounds 1-12, Figure 1) and test dataset (compounds 13-16, 
Figure 2) using a random selection method that is part of the 
Strike 1.9 module integrated in the Maestro 9.1 molecular 
modeling suite (Schrodinger LLC, Rockville, MD, USA). A 
75% compound separation irrespective of inhibition activity 
or structural composition was achieved for the whole dataset. 
Gasteiger-Hiickel charges were added before the molecular 
field calculations and model elaboration. Energy minimization 
was performed using the Tripos force-field 23 24 with a distance- 
dependent conjugate gradient algorithm. The minimum gra- 
dient of 0.05 kca^mok 1 was set as a convergence criterion. 24 
Default values were used with a 2.0 A grid spacing using the 
common sp 3 carbon atom hybridization with a +1 point charge 
as a probe to investigate the steric and electrostatic interactions 
at the lattice grid points. Meanwhile, a default value of 0.3 
was used as the attenuation factor, and a distance-dependent 
Gaussian-type functional form was used between the grid 
point q and each atom i in the molecular structure. This set- 
ting can avoid singularities at the atomic positions and the 
dramatic changes of potential energy caused by the grid in 
surface proximity. 25 The column filtering value was set to 2.0 
kca^mok 1 . Statistical analysis was performed in the Sybyl-X 
program (Tripos International, St Louis, MO, USA) using 
the partial least square method. Crossvalidated (q 2 ) and non- 
crossvalidated (r 2 ) values were determined for the models using 
linear regression analysis with variances represented as the 
standard error of estimation. 26 For the number of components, 
a consecutive default (six components) was provided in the 
Sybyl-X program based on the crossvalidated accuracy of the 
model; the resulting model q 2 might be subject to overfitting 
by parameter selection. Therefore, we reduced this number to 
5 and increased the statistical significance of the q 2 (0.6) and 
r (0.8) values, which were previously considered to be statisti- 
cally significant even at 0.3 for q 2 and 0.7 for r 2 . 26 
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Pelargonidin 

Figure I Chemical structure of anthocyanin derivatives used as a training set for the three-dimensional quantitative structure-activity relationship study. 



The electron diffraction of human microsomal CYP3A4 
as a holostructure (PDB ID: 1TQN) at 2.05 A resolution 
with the heme cofactor was retrieved from the Protein Data 
Bank as a PDB file. 27 Structural refinement and molecular 
docking studies were performed on the CYP3A4 crystal 
structure using DockingServer (Virtua Drug Ltd, Budapest, 



Hungary), which allows for the carrying out of highly effi- 
cient and robust docking calculations in one comprehensive 
web service. 28 The ligand preparation included Gasteiger 
partial charges assignment and rotatable bonds definition 
before all nonpolar hydrogen atoms were merged. All 
necessary solvation parameters were added with the aid of 
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AutoDock tools (Scripps Research Institute, San Diego, CA, 
USA). 29 Rigid-flexible molecular docking was applied to 
the center of the heme-associated binding cavity using the 
following Cartesian coordinates: x =-15.67 A, y =-22.49 A, 
and z =-1 1.2 A. The AutoDock output results represented 
the docking scores as Gibbs free energy of binding (AG) 
values, further converted to the predicted inhibition constants 
(K. p. Affinity grid maps of 20x20x20 A grid points, and 
0.375 A spacing were generated using the AutoGrid pro- 
gram (Scripps Research Institute, San Diego, CA, USA). 29 
AutoDock parameter set-dependent and distance-dependent 
dielectric functions were used in the calculation of the van 
der Waals and electrostatic terms, respectively. Docking 
simulations were performed using the Lamarckian genetic 
algorithm and the Solis and Wets local search method. 30 Initial 
position, orientation, and torsions of the ligand molecules 
were set randomly. All rotatable torsions were released dur- 
ing docking. Each docking experiment was derived from ten 
different runs that were set to terminate after a maximum of 
250,000 energy evaluations while the population size was set 
to 150. During the search, a translational step of 0.2 A, and 
quaternion and torsion steps of 5 were also applied. 

The IC 50 values were obtained from the study reported by 
Dreiseitel et al, as determined when investigating the inhibi- 
tion of CYP3 A4 activity in a concentration-dependent manner 
using a chemiluminescent assay. 14 The biological data was 
further converted to pK. (negative decimal logarithm of 
experimental inhibition constant) parameters using the pIC 50 
value calculator 31 according to the Michaelis-Menten enzyme- 
substrate interaction equation for noncompetitive species: 32 



Kj = IC 50 , when S = K m orS » K m or S « K m 
where S is the substrate concentration and K is the 

m 

Michaelis-Menten constant. Molecular graphics and visu- 
alizations were performed using Chimera version 1.6.2 
software (Resource for Biocomputing, Visualization, and 
Informatics, San Francisco, CA, USA). The electrostatic 
potential maps were calculated with DelPhi version 5.1 
(Computational Biophysics and Bioinformatics, Clemson 
University, SC, USA) using a finite difference solution to 
the Poisson-Boltzmann equation. 33 The interactive two- 
dimensional diagrams were constructed from Cartesian three- 
dimensional coordinates with the aid of Molecular Operating 
Environment 20 10. 10 software (Chemical Computing Group 
Inc., Montreal, QC, Canada). Statistical analyses were con- 
ducted using a linear regression analysis followed by graphi- 
cal representation with the aid of GraphPad Prism version 4 
(GraphPad Software, San Diego, CA, USA). 

Results and discussion 

The statistical parameters of the standard CoMFA model 
were constructed using rigid body alignment with all 
charges assigned to the molecules. To emphasize the dif- 
ferences between the minimal-energy conformations of 
anthocyanin derivatives, we aligned these compounds with 
the pelargonidin structure used as a common scaffold, and 
measured the root-mean-square deviation values. The root- 
mean-square deviation parameters significantly correlated in 
an inverse manner with the experimentally determined inhibi- 
tory constants (pK. ); structural deviation decreased 
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Figure 3 (A) Rigid body alignment of the anthocyanin compounds used in this study. Hydrogen atoms are omitted for clarity. A bar chart for the compound numbers (B) 
and schematic data smoothing (C) were utilized to compare the values and produce a fitting curve for the RMSD and pK i(exp) comparison. 
Abbreviations: RMSD, root-mean-square deviation; pK ijexp) , negative decimal logarithm of experimental inhibition constant. 



with an increase in compound activity and vice versa 
(Figure 3A-C). The most deviated substances (root-mean- 
square deviation >0. 1 A), such as cyanidin-3,5-diglucoside, 
cyanidin-3-rutinoside, cyanidin-3-galactoside, malvidin-3,5- 
diglucoside, andpeonidin-3-glucoside, had additional sugar 
moieties in their structure that mainly contributed to the 
conformational mismatch. The sugar residues changed their 
spatial orientation as a result of significant conformational 
rearrangements caused by rotations around the col (C2-C3- 
O-sugar) and co2 (C6-C5-0-sugar) dihedral angles. 

The q 2 , r 2 , F, and standard error of estimation values were 
computed as denned in the Sybyl-X suite. The training set 
was preliminarily checked for outliers. Examination of the 
residuals from the crossvalidated predictions indicated that 
there were no outliers in the CoMFA model. According to the 
results from the CoMFA analysis shown in Tables 1 and 2, 
the crossvalidated partial least square (q 2 ) was found to be 
0.795 with five components, indicating that the CoMFA 
model is statistically significant and highly predictive. The 



non-crossvalidated regression value showed a conventional r 2 
of 0.962, an F of 250.8, and a standard error of estimation of 
0.083 (P <0.000 1 ), suggesting that the model could correctly 
predict compound activity in 96% of cases (Figure 4A). The 
contributions from the steric and electrostatic fields were 

Table I Results from the three-dimensional quantitative 
structure-activity relationship analysis for the training dataset 



CoMFA CoMSIA 

Cross-validated 

q 2 0.795 0.687 

Number of components 5 5 

Column filtering (kcal/mol) 2.0 2.0 
Non-crossvalidated 

r 2 0.962 0.948 

SEE 0.083 0.09S 

F 250.8 184.2 

Steric 0.676 0.443 

Electrostatic 0.324 0.557 



Abbreviations: SEE, standard error of estimation; CoMFA, comparative molecular 
field analysis; CoMSIA, comparative molecular similarity index analysis. 
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Table 2 Predicted affinities for CoMFA and CoMSIA models of anthocyanin derivatives 



Compound 


IC 50 OlM) 


P K i(exp) 


P^i (CoMFA) 


Residual 


P^i (CoMSIA) 


Residual 


Cyanidin 


20.8 


4.68 


4.722 


0.042 


4.694 


0.014 


Cyanidin-3,5-diglucoside 


167.5 


3.76 


3.781 


0.021 


3.804 


0.058 


Cyanidin-3-2alactoside 


80.2 


4.095 


4.197 


0.102 


4.193 


0.098 


Cyanidin-3-glucoside 


105.3 


3.98 


4.075 


0.095 


4.058 


0.078 


Cyanidin-3-rutinoside 


96.1 


4.02 


4.022 


0.002 


3.997 


-0.02 


Delphinidin-3-glucoside 


74.1 


4.13 


4.174 


0.044 


4.083 


-0.047 


Malvidin-3,S-diglucoside 


225.5 


3.65 


3.561 


-0.089 


3.572 


-0.078 


Malvidin-3-galactoside 


84.3 


4.07 


4.09 


0.002 


4.206 


0.136 


Malvidin-3-glucoside 


80 


4.096 


3.99 


-0. 1 06 


4.005 


-0.091 


Pelargonidin 


12.2 


4.91 


4.777 


-0.133 


4.75 


-0.16 


Peonidin 


16 


4.795 


4.721 


-0.074 


4.729 


-0.066 


Petunidin 


23.4 


4.63 


4.706 


0.076 


4.746 


0.1 16 


Delphinidin 


46.5 


4.333 


4.697 


0.364 


4.658 


0.325 


Malvidin 


14 


4.85 


4.709 


-0.141 


4.72 


-0.13 


Pelargonidin-3,S-diglucoside 


249 


3.603 


3.61 1 


0.008 


3.737 


0.134 


Peonidin-3-glucoside 


79.2 


4.101 


3.993 


-0.108 


3.954 


-0.147 


Abbreviations: CoMFA, comparative molecular field analysis; 
negative decimal logarithm of experimental inhibition constant 


CoMSIA, comparative molecular similarity index analysis; IC S0 , half maximal inhibitory concentration; pK. . ., 
pK. (CoMFA) , negative decimal logarithm of CoMFA-predicted inhibition constant; pK. (CoMslA) , negative decimal 



logarithm of CoMSIA-predicted inhibition constant. 



detected to be 67% and 33%, respectively. This observation 
was unexpected given that the anthocyanin derivatives used 
to build up the CoMFA model were all positively charged. 
Therefore, their positive charges contribute mainly to elec- 
trostatic fields but not to the steric hindrance, which might 



be due to the volume restrictions associated with the narrow 
binding cavities. 

In the next step, the CoMFA model was validated by the 
external test set of four compounds, ie, delphinidin, malvidin, 
pelargonidin-3,5-diglucoside, and peonidin-3-glucoside. 
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Figure 4 (A) Predicted and experimental pK. values from the comparative molecular field analysis model correlated with each other by linear regression analysis. The 
comparative molecular field analysis maps are shown in respect to pelargonidin as a reference molecule. (B) Steric maps are colored in green for steric bulk areas with an 
increased activity. (C) Electrostatic maps are colored in blue for positively charged areas with an increased activity and red ones for negatively charged areas. Hydrogen 
atoms are omitted for clarity. 

Abbreviations: pK. (exp) , negative decimal logarithm of experimental inhibition constant; pK. (CoMFA) t negative decimal logarithm of CoMFA-predicted inhibition constant. 
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It can be seen from Tables 2 and 3 that the CoMFA model 
could correctly predict the compound affinity with an r 2 of 
0.821, an F of 9.1 17, and a standard error of estimation of 
0.281 (P<0.0939), suggesting that the CoMFA model could 
accurately predict compound activity in 82% of cases. The 
pelargonidin molecule as the strongest CYP3A4 inhibitor 
(IC 50 12.2 uM) is shown in Figure 4B and C with the CoMFA 
steric and electrostatic maps. The green maps represent the 
areas where increasing bulk would likely lead to an increase 
in activity, while the yellow steric regions (if any) indicate 
areas with decreased compound activity. We observed that 
there is a zone of optimal steric favor that surrounds the 
hydroxyl groups of 4-hydroxyphenyl and chromenylium- 
3,5,7-triol moieties. The blue and red electrostatic maps 
indicate the positively and negatively charged areas with an 
increase and decrease in activity, respectively. Interestingly, 
the large red area was located at the 4-hydroxyphenyl part 
of the molecule, indicating only minor contribution to the 



5.0- 



inhibition effect due to its conformational flexibility. This 
flexibility might be explained by the phenomenon of aglycone 
internal rotation around the 8 angle (01-C2-C1 '-C2'), which 
defines the coplanarity of the A, B, and C rings. In general, 
the maps represent a molecular state that is close to optimal 
for binding to the CYP3A4 protein when it was detected as 
IC 50 in the previous experiments. 14 

The results of the CoMSIA analysis are shown in 
Tables 1 and 2. Similarly to the CoMFA model, we used 
the steric and electrostatic fields in the CoMSIA analysis. 
Partial least square crossvalidation showed a regression 
value (q 2 ) of 0.687, which indicates a good predictive model. 
The optimal number of components (five) was found from 
previous crossvalidation analysis to build up the nonvali- 
dated linear regression model with a high Pearson's correla- 
tion coefficient (r 2 ) of 0.948, an F of 184.2, and a standard 
error of estimation of 0.095 (P<0.0001, Figure 5A). The 
relative contributions from the steric and electrostatic fields 



o Training set 
• Test set 




B 



C 

^^^^^^^ 



Figure 5 (A) Predicted and experimental pK. values from the comparative molecular similarity index analysis model correlated with each other by linear regression analysis. 
The comparative molecular similarity index analysis maps are shown in respect to pelargonidin as a reference molecule. (B) Steric maps are colored in green for steric bulk 
areas with an increased activity and yellow ones, where steric bulk will likely to lead to a decrease in activity. (C) Electrostatic maps are colored in blue for positively charged 
areas with increased activity and red ones for negatively charged areas. Hydrogen atoms are omitted for clarity. 

Abbreviations: pK. (exp) , negative decimal logarithm of experimental inhibition constant; pK |(CoMSIA) , negative decimal logarithm of CoMSIA-predicted inhibition constant. 
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were detected to be 44% and 56%, respectively. In this case, 
all molecular charges contributing to the electrostatic fields 
were dominating over the steric effect. The latter parameter 
was decreased as a result of the relatively large size of the 
CYP3A4 binding cavity that might release the steric hin- 
drance in various sizes and shapes of ligand structures. 

It can also be seen from Tables 2 and 3 that the CoMSIA 
model was validated by the external test set of the same four 
compounds with an r 2 of 0.812, an F of 8.636, and a standard 
error of estimation of 0.263 (P<0.0989), suggesting that the 
CoMSIA model could accurately predict compound activity 
in 81% of cases. 

As compared with the CoMSIA model, the CoMFA 
model appeared to be more predictive, partly due to the small 
deviation molecular alignment and also the fact that CoMSIA 
relies on explicit hydrophobic and hydrogen bond donor or 
acceptor field descriptors. 

The CoMSIA model is shown in Figure 5B and C with the 
pelargonidin structure used as a reference molecule. The green 
maps indicate areas where addition of bulky steric chemical 
groups would presumably impose an increase in compound 
activity, while the yellow ones would reverse this effect, as 
already observed in the previous CoMFA model. Additionally, 
there was an increase in positively charged areas with higher 
activity colored in blue in the vicinity of the hydroxyl group 
in the 4-hydroxyphenyl moiety and the hydroxyl group in the 
3'-position of the C-ring of the flavonoid backbone. 

To gain some insight into the binding characteristics of 
anthocyanin derivatives as CYP3A4 inhibitors, we docked 
them into the heme cofactor-associated binding cavity of 
CYP3A4. During the docking process, the protein was 
considered to be rigid while the ligands were flexible. The 
structure of CYP3 A4 turned out to be much more open in the 
proximity of the heme ion, resulting in more uniform distribu- 
tion of the cavity volume and easier ligand binding. 27 

Eventually, only 40% of the total binding cavity area was 
occupied by all docked compounds, reflecting the relatively 
large size of the active cavity, which is a well established 
characteristic of CYP3A4. 6 34 Thus, this enhances simultane- 
ous molecular occupancy, which occurs near the catalytic 



Table 3 Results from the three-dimensional quantitative 
structure-activity relationship analysis for the test dataset 





CoMFA 


CoMSIA 


r 2 


0.821 


0.812 


SEE 


0.281 


0.263 


F 


9.1 17 


8.636 



Abbreviations: SEE, standard error of estimation; CoMFA, comparative molecular 
field analysis; CoMSIA, comparative molecular similarity index analysis. 
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Figure 6 (A) Rigid-flexible docking of anthocyanin derivatives into the binding site of 
the cytochrome P450 3A4 molecule. The molecular surface is divided by the frontal 
plane to visualize a binding channel of the protein structure. Red and blue colors 
are depicted for negative and positive electrostatic potentials; while zero potential 
is in white. Ligands are colored according to their atom types. Hydrogen atoms are 
omitted for clarity. (B) Relationship between predicted binding constants calculated 
from the docking studies with cytochrome P450 3A4 and experimental ones 
determined by a sensitive chemiluminescence assay for 1 6 anthocyanin derivatives. 
Abbreviations: pK. , negative decimal logarithm of experimental inhibition 
constant; pK. (prEd| , negative decimal logarithm of predicted inhibition constant. 

center of the enzyme that contributes to heterotrophic cooper- 
ativity by facilitating alternative binding modes for C YP3 A4 
inhibitors, as shown experimentally in many studies. 35-37 
However, anthocyanin derivatives occupy different bind- 
ing regions within the same binding cavity (Figure 6A), 
with different amino acid residues involved because of their 
dihedral rotation and conformational mismatch. 

In order to find common similarities in the CYP3A4 
ligand interaction modes for the strongest inhibitors, we 
selected pelargonidin as the best experimental CYP3A4 
inhibitor with a pK i(exp) of 4.91 and cyanidin as the best pre- 
dicted CYP3A4 binder with a pK. (pred) of 5 .2 1 . Taken together, 
we found that the Phe447 and Phe435 aromatic rings of 
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Figure 7 Two-dimensional interaction maps showing important amino acid residues playing a role in binding of pelargonidin (A) and cyanidin (B) molecules to cytochrome 
P450 3A4. 



CYP3A4 interact with these two molecules by establishing 
k-k stacking with the A ring for pelargonidin and the B ring 
for cyanidin (Figure 7A and B). Similarly, previous theoreti- 
cal and experimental studies have elucidated the effect of 
k-k stacking between phenylalanine groups and aromatic 
rings of various CYP3A4 substrates and inhibitors. 38 ' 39 In 
addition, the cyanidin hydroxyl functional group at position 4 
of the B ring (3,4-dihydroxyphenyl moiety) forms an H-bond 
with the backbone acceptor oxygen atom of Pro434, thereby 
contributing to an increase in binding affinity. 

Table 4 Predicted binding affinities for anthocyanin derivatives 



Compound 



AG 

(kcal*mo|-') 



(pred) 

OlM) 



P K i ( pr.d) 



Cyanidin 


-7.1 


6.21 


5.21 


Cyanidin-3,S-diglucoside 


-2.86 


7,980 


2.09 


Cyanidin-3-galactoside 


-4.79 


309.28 


3.51 


Cyanidin-3-glucoside 


-S.S 


93.65 


4.03 


Cyanidin-3-rutinoside 


-6.1 


33.71 


4.47 


Delphinidin-3-glucoside 


-6.31 


23.72 


4.62 


Malvidin-3,S-diglucoside 


-2.4 


1,735 


2.76 


Malvidin-3-galactoside 


-6.0 


39.81 


4.4 


Malvidin-3-glucoside 


-5.28 


134.71 


3.87 


Pelargonidin 


-6.76 


1 1.07 


4.95 


Peonidin 


-6.91 


8.56 


5.07 


Petunidin 


-6.7 


12.17 


4.91 


Delphinidin 


-6.5 


17.32 


4.76 


Malvidin 


-6.91 


8.59 


5.07 


Pelargonidin-3,5-diglucoside 


-4.1 1 


971.9 


3.01 


Peonidin-3-glucoside 


-6.82 


10.07 


4.99 



Abbreviations: AG, Gibbs free 
constant; pK i(pred) , negative decimal 



energy of binding; K. predicted inhibition 
logarithm of predicted inhibition constant. 



In the molecular docking procedure, a cysteine amino acid 
(residue number 442) presents at the binding pocket, which is 
a strong nucleophile and participates easily in covalent inter- 
actions with other compounds. Furthermore, there are some 
reports in the literature of cysteine residues in the CXXCH 
motif of CYP acting to form covalent thioether bonds. 40 - 41 There- 
fore, there might be some opportunity for a covalent interaction 
to occur between the Cys442 residue and the ligands. 

The next step was to explore whether or not the experi- 
mental values (K , , or pK , ,) can correlate with the 

v l (exp) r l (exp)' 

AG-derived inhibition constants (K i( .. or pK. (jjrcd) , Table 4). 
The Michaelis-Menten enzyme-substrate interaction equa- 
tion was applied to the inhibitory activities represented as IC 50 
values. Consequently, a model with r 2 of 0.66 was obtained 
for 16 compounds (Figure 6B), which indicates that the 
correlation was statistically significant, for the theoretical 
molecular docking study, thus demonstrating the applicability 
of this method. This model eventually confirmed the previ- 
ous observations found in our CoMFA/CoMSIA models and 
previously published experimental results 14 42 that anthocya- 
nin derivatives, generally occurring as 3-O-glycosides, have 
bioactivity attributable to the aglycone but not to the sugar 
moiety or other binding species. 

Conclusion 

In this paper, we describe the development of 3D-QSAR 
models with CoMEA and CoMSIA methods and molecular 
docking procedures to explain the inhibition of anthocyanin 
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derivatives on CYP3A4. Initial attempts at using a ligand- 
based method for construction of the CoMFA and CoMSIA 
models were based on the 1 6 reported anthocyanin derivatives 
through additions of hydroxyl groups and glycoside residues 
in different positions and this resulted in models with over 
80% predictability. Switching to the protein-based docking 
method, the predicted binding energy from AutoDock pro- 
vided a correlation with the experimental IC 50 values of at 
least 66%. The n-n stackings with Phe447 and Phe435 were 
indicated as key interactions for the anthocyanin derivatives. 
The hydrogen bond interaction between Pro434 and the 
hydroxyl group on the cyanidin B ring was also considered 
to contribute to the increased binding affinity. 

Overall, in the present investigation, the number of sugar 
residues predicted and confirmed a "structural prerequisite" 
for a decline in the anthocyanin inhibition effect on CYP3 A4, 
which underscores the urgent need for more detailed analysis 
of the structure-activity relationship. 

Finally, we emphasize the importance of structure- 
activity relationship models in the development of CYP 
inhibitors. These models could be a valuable tool in the 
design of drug-like molecules, which are targeted for use as 
inhibitors of cytochromes, which are the enzymes responsible 
for carcinogen activation and cellular DNA damage. 
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